HOO 2012 Error Recognition and Correction Shared Task: Cambridge University Submission Report
نویسندگان
چکیده
Previous work on automated error recognition and correction of texts written by learners of English as a Second Language has demonstrated experimentally that training classifiers on error-annotated ESL text generally outperforms training on native text alone and that adaptation of error correction models to the native language (L1) of the writer improves performance. Nevertheless, most extant models have poor precision, particularly when attempting error correction, and this limits their usefulness in practical applications requiring feedback. We experiment with various feature types, varying quantities of error-corrected data, and generic versus L1-specific adaptation to typical errors using Naı̈ve Bayes (NB) classifiers and develop one model which maximizes precision. We report and discuss the results for 8 models, 5 trained on the HOO data and 3 (partly) on the full error-coded Cambridge Learner Corpus, from which the HOO data is drawn.
منابع مشابه
The UI System in the HOO 2012 Shared Task on Error Correction
We describe the University of Illinois (UI) system that participated in the Helping Our Own (HOO) 2012 shared task, which focuses on correcting preposition and determiner errors made by non-native English speakers. The task consisted of three metrics: Detection, Recognition, and Correction, and measured performance before and after additional revisions to the test data were made. Out of 14 team...
متن کاملInforming Determiner and Preposition Error Correction with Word Clusters
We extend our n-gram-based data-driven prediction approach from the Helping Our Own (HOO) 2011 Shared Task (Boyd and Meurers, 2011) to identify determiner and preposition errors in non-native English essays from the Cambridge Learner Corpus FCE Dataset (Yannakoudakis et al., 2011) as part of the HOO 2012 Shared Task. Our system focuses on three error categories: missing determiner, incorrect de...
متن کاملInforming Determiner and Preposition Error Correction with Hierarchical Word Clustering
We extend our n-gram-based data-driven prediction approach from the Helping Our Own (HOO) 2011 Shared Task (Boyd and Meurers, 2011) to identify determiner and preposition errors in non-native English essays from the Cambridge Learner Corpus FCE Dataset (Yannakoudakis et al., 2011) as part of the HOO 2012 Shared Task. Our system focuses on three error categories: missing determiner, incorrect de...
متن کاملKorea University System in the HOO 2012 Shared Task
In this paper, we describe the Korea University system that participated in the HOO 2012 Shared Task on the correction of preposition and determiner errors in non-native speaker texts. We focus our work on training the system on a large collection of error-tagged texts provided by the HOO 2012 Shared Task organizers and incrementally applying several methods to achieve better performance.
متن کاملA Naive Bayes classifier for automatic correction of preposition and determiner errors in ESL text
This is the report for the CNGL ILT3 team entry to the HOO shared task. A Naive-Bayes-based classifier was used in the task which involved error detection and correction in ESL exam scripts. Our system placed 11th out of 14 teams for the detection and recognition tasks and 11th out of 13 teams for the correction task on the based on f-score for both preposition and determiner errors.
متن کامل